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0.1 The Organization of This Paper 

The first section starts with the basic definitions following mainly the notations of the 
book written by E. Kushilevitz and N. Nisan [8]. This whole thesis started from this book 
and some parts follow the proofs from there but I always tried to simplify and improve 
them, I even discovered and corrected a false claim. But most of the results are mine, 
supervised by Zoltan Kiraly. At the beginning of each part I will indicate if a result is not 
mine. 

At the end of the first section I examine tree-balancing. At the very end I introduce a 
function that I will use only in the later sections. 

In the second section I summarize the well-known lower bound methods and prove the 
exact complexity of certain functions. 

In the first part of the third section I introduce the random complexity and prove the 
basic lemmas about it. In the second part I prove a better lower bound for the complexity 
of all random functions. In the third part I introduce and compare several upper bounds 
for the complexity of the identity function. 

In the fourth section I examine the well-known Direct-sum conjecture. I introduce a 
different model of computation then prove that it is the same as the original one up to a 
constant factor. This new model is used to bound the Amortized Time Complexity of a 
function by the number of the leaves of its protocol-tree. After this I examine the Direct- 
sum problem in case of Partial Information and in the Random case. 

In the last section I introduce the well-known hierarchy classes, the reducibility and 
the completeness of series of functions. Then I define the class PSPACE and Oracles in 
the communication complexity model and prove some basic claims about them. 

There are plenty questions left open in this paper, I hope I can manage to make some 
progress in some of them during the forthcoming years. 

I would like to thank Zoltan Kiraly, my supervisor, for his guidance, help and inspira- 
tion. This thesis would be nowhere without him. 



1 Introduction and Tree-balancing 

1.1 Introduction 

Communication Complexity is a computational model introduced by Yao in 1979 
Since then, many papers have been written about it, more likely because of its consequences 
in applications, including VLSI theory, and because it is more tractable than computational 
complexity, it is easier to prove lower bounds. It also has a very simple definition. 

Two players, commonly denoted by A and B and called in many names from Alice and 
Bob through Alfred and Paul to Rosencrantz and Guildenstern, both hold an input from 
a finite set, x G A and y G F, and wish to compute f{x,y) where / is a given function 
known to both of them. To do this, of course, they have to communicate. They are allowed 
to send bits to each other. For example, we give both of them a number from 1 to 10 and 
ask them whether their numbers are the same or not. They can solve this, if A transforms 
her number to base two numbering system, then she sends each bit to B, who can com- 
pare it with his number and send the result back to her. This algorithm requires 4 + 1 bits. 

The last bit of their communication have to be the value of the function if the range 
of the function is {0, 1}, this is why in the case of determining the exact length of the 
necessary communication, we write it with a '+1' to avoid confusion. Another reason to 
use this notation is that if only one of them has to know the answer, then they have to 
communicate exactly one bit less. When we are interested only in the order of magnitude, 
sometimes we even omit constant multipliers. 

A more challenging task, as usual, is to prove a lower bound for the length of the worst 
case communication. In this paper we present several techniques for this. 

1.2 Basic Definitions 

In this section we follow the book ([S] pp. 3-10 and 16-19). 

The formal definition of Communication Complexity is the following: 

Definition 1.1. For a function f : X x Y ^^ Z , the (deterministic) communication 
complexity of f is the minimum length of P, over all protocols P that compute f . We 
denote it by D{f). 

Of course, we have to define Protocol: 

Definition 1.2. A Protocol is a binary tree. There is a function X — t- {0, 1} orY ^ {0, 1} 
associated to each of the internal nodes and there is an element from Z belonging to each 
leaf. If we start from the root of the tree and turn at each node at the direction determined 
by the function, we have to end in a leaf with the value f{x,y). 

It is easy to see that this is equivalent to the communication of A and B as described 
in the previous section. In the proofs we are often using this fact because it is easier to 



imagine it like that. 

In the following claim and throughout this paper, let logn denote loggn, although 
sometimes we mean [log2 n] or [log2 n\ , the reader may always easily figure out which. 

Claim 1.3. D{f) < log |X| + log \Z\ 

Proof. A sends a 0-1 sequence that encodes her input (in a predetermined way), then B 
computes the value of / using his unlimited computational power and sends back the result 
to her in a 0-1 sequence. D 

From now on, Z will be {0, 1} because these are the most studied functions and the 
ones we are dealing with in this paper. We also assume, unless we state otherwise, that 
X = Y = {0,1}". 

We can imagine / as a matrix; the rows are representing the inputs of A while the 
columns are representing the inputs of B. The entries are the values associated to the 
corresponding inputs. 

Now we define some of our favorite functions: 
Definition 1.4. 

• EQ{x,y) = 1 zjfx = y. 

• NE{x,y) = Oiffx = y. 

• GT{x,y) = 1 iffx>y. 

• IP{x,y)= {x,y) mod 2 = ^-Xiyi mod 2. 

• DISJ{x,y) = 1 iffY^t^iVi = 0- 

. F{x,y) = OiffF{x,y) = l. 

Because of Claim [TT^ D{f) < n + 1 holds for all /s whose range is {0, 1}, thus for all 
of the above functions. 

It is easy to see that in each node of the protocol-tree one of the players, whose turn 
it is to speak at that node, splits one's set of inputs into two parts. This is equivalent 
to dividing the set of rows (or columns) into two. So after each step, we have another 
communication problem to solve, that has a smaller matrix formed by the remaining rows 
and columns of the players. 

Definition 1.5. A set R of entries is a rectangle in a matrix iff R = I x J where / C X 
is a set of rows and J (1 Y is a set of columns. A rectangle is monochromatic if all of its 
entries are the same. We call these respectively 0-rectangle and 1-rectangle. 

Claim 1.6. The input-pairs leading to the same leaf in a protocol form a monochromatic 
rectangle. Moreover, these rectangles partition the matrix. 



Proof. We prove by induction that in each step of the protocol the input-pairs leading 
to that node form a rectangle. This is true since in each node of the protocol-tree one of 
the players is splitting her remaining inputs into two disjoint sets, both giving a rectangle 
because of the induction. The rectangle of the leaf is monochromatic because the protocol 
computes /. 
Each entry belongs to exactly one leaf, thus we get a partition indeed. D 

We call the leaves that lead to a 0-rectangle 0-leaves and those that lead to a 1-rectangle 
1-leaves. 

Note that not all rectangle-partitions can be associated to protocols. To distinguish, 
we need the following Definition: 

Definition 1.7. 

• The protocol partition number of f , C^{f), is the smallest number of leaves in a 
protocol-tree that computes f . 

• The protocol 0-partition number of f , C(f (/), is the smallest number of 0-leaves in a 
protocol-tree that computes f . Similarly, Cf (/) is the smallest number of 1-leaves. 

• The partition number of f , C^{f), is the smallest number of monochromatic rectan- 
gles that can partition the matrix of f . 

• The 0-partition number of f , C(f'(/), is the smallest number of 0-rectangles in a 
monochromatic rectangle partition of the matrix of f . Similarly, C^{f) is the smallest 
number of 1-rectangles. 

• The cover number of f , C{f), is the smallest number of monochromatic rectangles 
that cover all the entries of the matrix of f . 

• The 1-cover number of f , Ci{f), is the smallest number of 1 -monochromatic rect- 
angles that cover the 1 entries of the matrix of f . Similarly, Co{f), is the smallest 
number of 0-monochromatic rectangles that cover the entries of the matrix of f . 

The following inequalities obviously follow: 

Claim 1.8. Cif) < C^{f) < C^if), 

Coif) < C(/) < C^if), 
Ciif) < Cni) < C[{f), 
C^{f) + Cr{f)<C^{f), 

c^{f) + cF{f) = c^{f), 

Co{f) + C^{f) = C{f). 

We can give a lower bound for D{f) by the logarithm of any of these values. This is 
the corollary of the previous and the following claim: 

Claim 1.9. D{f) > 1 + logCf (/). 



Proof. We split each leaf that is not at the bottom (the deepest part) of the tree into a 
1-leaf and a 0-leaf. Now the depth did not increase and we have at least 2Cf (/) leaves. A 
binary tree with depth D(f) can have at most 2^^-^^ leaves, so we have 2^^-^^ > 2C[{f), 
just what we wanted. 

D 

Corollary 1.10. D{f) > l + \ogC^{f). 

Corollary 1.11. D{f) > \ogCp{f). 

The most important of these values is C^{f) because D[f) = Q{logC^{f)). This can 
be proved with a tree balancing Lemma. 

1.3 Tree- balancing 

Here the first lemma is from the book ([8] pp. 19-20), the other one is our result. 

Lemma 1.12. \ogC^{f) < D{f) < 31ogC^(/). 

Proof. The first inequality follows from the previous claim. 

To prove the second one, we have to construct a shallow protocol from a protocol-tree with 

C^{f) leaves. We prove by induction on the number of leaves that D{f) < 31ogC^(/). 

Claim 1.13. In a tree there always exists a vertex, such that leaving out this vertex, all 
components of the remaining graph have at most half of the original leaves. 

Proof. If an edge splits the leaves into two even parts, we are done. Otherwise, direct each 
edge toward the bigger part. There must be a node with outdegree 0, this will do. D 

We choose such a node in our protocol-tree. A and B both send a bit indicating 
whether their inputs allow the path to the node, that means whether their input intersects 
the rectangle of that node. (We say that an input intersects a rectangle, if the row (column) 
of the input is the row (column) of the rectangle.) If one of their inputs does not intersect 
the rectangle, we leave out this node and everything under it, we have halved the number 
of nodes with 2 bits. If both of their inputs intersect the rectangle, then we can restrict the 
rest of the game only to this rectangle. The person who has to speak at this node sends 
one more bit, now we have halved the number of leaves (because of the splitting property 
of the chosen node) with 3 bits. Thus we can halve the number of leaves in both cases with 
at most 3 bits of communication, this completes our proof. D 

The exact relation between C^{f) and C^{f) is yet unsolved although Kushilevitz et 
al. [7j showed that a small gap (2C^(/) = C^{f)) is possible. 

We have another useful lemma about balancing protocol-trees that does not hold in 
most computational models but it does in communication complexity. We can prove that 
Cf(/) and C(f (/) can differ only in a constant factor, so communication complexity is 
result-balanced, we have almost the same number of both type of outcomes. We prove this 
here using only certain local transformations of the protocol-tree, this gives the following 
result: 
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Lemma 1.14. CQ{f) < 4Cf (/) — 2 unless f is constant. 

Proof. In fact we prove a somewhat stronger statement, that this holds for any protocol- 
tree without unnecessary nodes. Let T be an arbitrary protocol-tree. We denote by L[T) 
the number of leaves, Li{T) {Lq{T)) denotes the number of 1-leaves (0-leaves). We denote 
by T^ the tree that we get by chopping the tree T at its node z, so the nodes of the new 
tree are z and its descendants. We denote by zi and Zr the children of z. We prove by 
induction on L{T). In fact we are going to use that Lq{Tz) < 4:Li{Tz) — 2 holds for all z 
non-leaf descendants of the node v and from this we obtain Lq{T„) < 4Li(T„) — 2 or we 
find an unnecessary node. For any node z we have one of the following cases: 

(1) Both of Zi and Zr are leaves; one of them have to be a 0-leaf, the other a 1-leaf or 
z would be unnecessary. This implies Lq{Tz) < 4Li{Tz) — 3. 

(2) None of Zi and Zr is a leaf; from the induction we have Lq{Tzi) < 4:Li{Tzi) — 2 and 
LoiTj < AL,{TJ - 2. This implies Lo(T,) < 4Li(T,) - 4. 

(3) One of zi and Zr is a 1-leaf; the induction on the other child implies Lq{Tz) < 
4(Li(T,)-l)-2 = 4Li(T,)-6. 

(4) One of zi and Zr is a 0-leaf. 

Now we can follow a straightforward argument. 

If for V we have one of the first three cases, we are done. So the only interesting case is 
when one of the children of f is a 0-leaf while the other is not a leaf. Wlog, we can suppose 
that at V it is A's turn to speak and we denote the non-leaf child of v by w. 





Figure 1: Trivial reduction 



Again, for w we must have the fourth case or we are done. If at w it were also A's turn 
to speak, then she could merge v and w, so one of the nodes is unnecessary. (See Figure 
[T],) So at w it is B's turn to speak. We shall denote the non-leaf child of w by u. 

If for u we have the second or the third case, we are done. Both in the first and the 
fourth case one of u-s children is a 0-leaf. If it is B's turn to speak, he could merge w 
and u (like in Figure [T]f and w were merged by A), hence we had an unnecessary node. 
If it is A's turn to speak, then she could merge v and u, and B can speak what he had 
to at w after this node, hence again one of the nodes would be unnecessary. (See Figure [J!) 




Before 




Figure 2: Reduction with one skip 



This completes the proof, so we gained LqIT^) < ALi{T^) — 2. This holds for all nodes, 
for the root of tree as well, so we have proved C(f (/) < AC[{f) — 2. D 

Of course, we can switch the roles of 0- and 1-leaves. 
Corollary 1.15. C[{f) < 4C(f (/) — 2 unless f is constant. 

We also obtained the following sharpening of Claim 11.91 

Corollary 1.16. Z^(/) = 0(Cf (/)). 

Note that none of the above lemmas is sharp, we do not know the exact values. For 
the latter, we can prove a sharper upper bound, that gives 3/2 instead of 4. This is done 
in the following section. 



1.4 Advanced Tree-balancing 

On a protocol-tree P that computes an arbitrary function we are going to make certain 
transformations that increase neither the number of 0-leaves, nor the number of 1-leaves, 
and do not even increase the depth of the tree. If none of these transformations can be 
made on P, then we can prove that the number of l-leaves is very close to the number of 
0-leaves. Of course, this implies the same for C[{f) and C(f (/). 

We call a node A-node if at the node it is A's turn to speak. We call a leaf A-leaf if at 
the father of the leaf it is A's turn to speak. We similarly define B-node and B-leaf. 
The magnitude of a leaf is the number of leaves under its father. Hence it is always at least 
two. 

The idea is to push down the 0-leaves in the tree as deep as possible. Eg., if an A-node 
has a 0-leaf hanging from it and its other son is an A-node as well, we can switch the two 
nodes. (See Figure [Si) So we can assume that if a 0-leaf is hanging down from a node, 
then at the other son of that node it is the other person's turn to speak. 





Figure 3: Push down through own node 



Can we push down a 0-leaf even deeper? Let us see what happens if we switch the 
father of an A-O-leaf with its son! Now unfortunately a new 0-leaf was created (see Fig- 
ure |4j) but the depth of the tree did not increase. We keep trying to push down these 
0-leaves, sometimes duplicating them again when pushing through a node of B. Some get 
eliminated (when meeting another 0-leaf), the others reach the bottom of the tree. The 
last node that they pass must be a B-node with a 1-leaf, otherwise we could push our leaf 
even deeper or eliminate our 0-leaf with an other 0-leaf. If at most one 0-leaf reached the 
bottom, this transformation did not increase the number of 0-leaves, thus we can execute 
it. If at least two 0-leaves would reach the bottom, we do not perform the transformation 



but we associate the B-1-leaves that we have reached with our original A-O-leaf. Note that 
we do not associate a B-l-leaf to two different A-0-leaves; the path leading from the higher 
one would go through the parent of the other but it is impossible because there it should 
have been eliminated. 





Before 



After 



Figure 4: Push down through other's node 



We perform the above operations as many times as we can. We would like to prove 
that we do not get into an infinite loop; we need something that strictly decreases after 
each transformation. If a leaf was eliminated, we have no problem, the number of leaves 
decreased. If not, let us consider the sum of the magnitudes of 0-leaves. 

Wlog, let us assume that an A-O-leaf is being pushed down. During the push down 
through an A-node, this sum will strictly decrease; its only element that changes is the 
magnitude of our A-O-leaf, it will decrease by the number of leaves in Y. (See Figure [Si) 
During the push down over a B-node, this sum cannot grow; only one of the two new 0- 
leaves survives and it has fewer 0-leaves under it than our original A-O-leaf had. However, 
it is possible that this sum remains the same, if one of the children of the B-node is a 
0-leaf. But in this case, we can push down either this B-O-leaf or our new A-O-leaf through 
the other son of the B-node (depending whether it is a B- or an A-node) unless it is a 
1-leaf. This is the only configuration that is wrong. (See Figure [51) We are going to call 
this configuration and the one we can obtain from it by switching the node of A and B, 
a twist and denote it by T. (These two can be obtained from each other, hence we do not 
need to distinguish them.) 

If we encounter a twist, we do not perform the transformation. Therefore we cannot 
get into an infinite loop. 

If we can perform none of the above operations, it means that every A-O-leaf whose 
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Figure 5: The Twists 



brotlier is not a 1-leaf, and wlio is not in a twist, has at least two B-1-leaves associated to 
it. We will call the paths leading from the father of the A-O-leaf to the B-1-leaves A-paths. 
We can similarly define the B-paths. 

It is high time to introduce some notations: 

We denote the number of twists by T. 
We denote the number of A-0-leaves by ag. 

We similarly denote the number of A-1-leaves by ai, the number of B-0- leaves by bo and 
the number of B-1-leaves by bi. 

We denote the number of A-0-leaves who are not in a twist and whose brother is also a leaf 
by Og"''^. We denote the number of A-0-leaves who are not in a twist and whose brother is 
not a leaf by «[,. We similarly define this for the other type of leaves as well. (Note that in 
each twist we have an A-O-leaf, a B-O-leaf and a 1-leaf, and none of them belongs to the 
end Qj^ ^Q ^Yie ' part. The 1-leaf can belong to any of the players, we can arbitrarily switch 
the nodes of the twist as we want to, so it makes no difference where we count the 1-leaf.) 
If we do not want to specify whether it is an A- or a B-leaf, we denote the corresponding 
number by L indexed by the appropriate things. Eg., ai + bi = Li. 

Hence we have T + a^o'^^ + a'Q = oq, T + b^J"^ + b'Q = bo and T + af"^ + a[+bf"^ + b\ = ai+bi. 
Because of the associations, we have bi > 2aQ and ai > 26g. Moreover, it is easy to see 
that an A-path or a B-path can not lead to a twist-leaf. (If it did, the 0-leaf would be 
eliminated by the 0- leaves of the twist.) Hence ai + bi — T > 2aQ + 26q. 
We also know that a^"'^ = Oq"'' and bl"''^ = 6q"'^ unless / is constant, but let us exclude this 
trivial case. 

Putting these all together we get ao + bo = T + ag"°' + a'^ + T + b"^'^ + b'^ < 2T + \{ai + 
bi-T)+ af '^ + 6f '^ = \Li + |T + Lf "^ < |Li + |(T + Lf '^) < 2Li. This immediately 
leads to the following better improvement of Lemma 11.141 for all nonconstant /s: 

Lemma 1.17. C(f (/) < 2Cf (/) unless f is constant. 
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Now all we have to do is reduce that constant | in front of T to 1 to obtain the desired 
upper bound. Note that this does not hold if the whole protocol-tree is a single twist. 
However, miraculously we can use the twists to prove the bound for every other tree. So 
let us suppose that our protocol-tree has more than four leaves, so it is not constant or a 
single twist. 

We are going to associate a non-twist 1-leaf that is not associated to any 0-leaf to each 
twist. Let us fix a twist, T*. We can suppose wlog, that the father of T* is an A- node. In 
this case there can be no A-path through this node. (If there were, the A-O-leaf could be 
eliminated by turning into the twist at its father.) Furthermore, we can start a twist-path 
going down from the father of T*; it is equivalent to hypothetically moving the A-O-leaf 
out of the twist (switching its father with the father of the twist) and then moving it 
down, exactly how we did with an ordinary A-O-leaf with the push down operations when 
we created the A-paths. The only exception is that we do not move it deeper; we only 
check whether it can be eliminated or not. If it can be, we reduced the number of 0-leaves, 
we are happy. If not, then we can associate it to a non-twist, so-far-not-associated B-l-leaf. 

This gives us the following inequality: T -|- 2aQ -|- 26q < ai + &i — T if L > 3. 
This leads to the improved inequality aQ + bo < |Li +T + L^'^ < |Li if L > 3. 
So we can state our final version of the result-balancing lemma: 

Lemma 1.18. C^{f) < lC(U) -unless C^U') < 3- 

Corollary 1.19. Cf (/) < |Co^(/) unless C^{f) < 3. 

However, it still remains an open question whether the | can be improved or not. Even 

^oif)/^iif) — ^ -'- i^ possible, but we do not think that is likely. 

1.5 A Technical Function 

Here we define a real function that comes up often in this paper and we give an esti- 
mation for it. We denote the inverse of a function by ^~^\ 

Definition 1.20. A(s) = 2'+^°^' = s2' , A(t) = A^-^\t). 

The estimation easily follows from the definition: 
Claim 1.21. log t- log log t < A(t) < logt. 

In fact with the help of this A function, we can get a formula for any similar function: 
Claim 1.22. (s + a)2' = 2-''A{s + a). 

Proof. A(s + a) = {s + a)2"+" = 2"(s + a)2'. D 

Corollary 1.23. ((s + a)2")(-i)(t) = A(t2'*) - a. 
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2 Lower Bounds 

There are three general methods to give a good lower bound for D{f). This whole 
section follows the book ([8] pp. 10-14) except Claim I2TT5] and the proving of the exact 
value for IP from it are our results. Theorem 12. Ill is a slightly improved version of the one 
in the book, this version is from jlOj . 

2.1 Fooling Sets 

Fooling sets are implicitly used in [,16j by Yao and defined in [9J by Lipton and 
Sedge wick. 

Definition 2.1. The elements of a 1-fooling set 5*1 are input-pairs {x^'^\y^'^^) with the 
following two properties: \/i f{x^^\y^'^^) = 1 andWi =^ j f{x^^\y^^^) = or f{x^^\y^'^^) = 0. 
Similarly, Sq is a 0-fooling set if Wi /(a;*^*\ y^*^) = and \/i ^ j f{x^''\y^^'^) = 1 or 
/(a;(^-),2/«) = l. 
We call a set S a fooling set if it is a 1-fooling set or a 0-fooling set. 

Claim 2.2. Ci(/) > \Si\ if Si is a 1-fooling set for f. 

Proof. It is enough to show that each element of a fooling set is in a different 1-rectangle. 
Indeed, if two elements were in the same rectangle, it would not be monochromatic. D 

Corollary 2.3. Co(/) > 15*01 if Sq is a 0-fooling set for f. 



Combining this claim with Claim [L9] we get: 

Corollary 2.4. D{f) > 1 + log \S\ if S is a fooling set for f . 

With the help of this fact, we can prove that our upper bounds were tight for EQ, NE, 
GT and DISJ. 

Claim 2.5. D{EQ) =n + l. 

Proof Si = {{x,x) : X e {0, l}'^} is a 1-fooling set. Thus D{EQ) > l + log(2") = n+1. D 

Corollary 2.6. D{NE) =n + l. 

The same set is a fooling set for GT as well. 

Corollary 2.7. D{GT) =n + l. 

Claim 2.8. D{DISJ) = n + l. 

Proof Si = {{x,l-x) -.xe {0, 1}"} is a 1-fooling set. Thus D{DISJ) > n + 1. D 

However, for IP we cannot find a sufficiently large fooling set. (We could not prove that 
there are not any but we think so.) For this, we need different lower bound techniques. 
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2.2 Rank Lower Bound 

This lower bound was discovered by Mehlhorn and Schmidt [12]. We denote the matrix 
associated to / by Mj, the rank of the matrix M by r{M). The proof of their theorem is 
based on the following simple fact well-known from linear algebra: 

Claim 2.9. M = A + B ^ r{M) < r{A) + r{B). 

We can imagine each step of the protocol as cutting the matrix M into two smaller 
matrices. We denote these by Mq and Mi. 

Corollary 2.10. r{M) < r{Mo) + r(Mi) 

Proof. If we add Os to Mq and Mi to make them as big as M was, we get two matrices, A 
and B, for which r{A) = r(Mo) and r{B) = r(Mi) and A + B = M. Now we can use the 
previous claim and we are done. D 

So in each step of the protocol the rank of one of the remaining matrices is at least 
half the rank of the previous matrix. Furthermore, we can assume that it has at least 
one entry (unless / is constant 1 but we can exclude this case). At the end, we get a 
0-monochromatic matrix whose rank is 0. Therefore we derived: 

Theorem 2.11. D{f) > 1 + logr(Mj) unless f is constant. 

With the help of the theorem, we can give a lower bound for D{IP). 
Mip ■ Mjp = Mfp is a very simple matrix; the first row and column are all Os, the diagonal 
of the rest is filled with 2"~^ and all the other entries are 2"~^. Its rank is 2"^ — 1. From 
the well-known linear algebraic fact that r{AB) < r{A), it follows that r{Mjp) > 2" — 1. 
From the theorem we got D[IP) > 1 + n. This is exactly what we wanted. 

It is a major open question, also known as the log rank conjecture whether D{f) = 
(log r(Mf))'^^^\ whether the complexity can be bounded from above by a polinom of 
log r(Mj-). The largest gap has been showed by Nisan And Wigderson in p,3j. They exhibit 
a function for which D{f) = n{n) and logr(M/) = 0(ni/'°s3) ^ Q(^^o.63i...y 

2.3 Discrepancy 

This lower bound has a parameter, a probability distribution fi over the elements of 
Mf. If fi is the uniform distribution, it equals to the number of elements in a given set 
divided by 2^". Let us denote the maximum of the measures of monochromatic rectangles 
by w. 

Claim 2.12. C^(/) > 1/w 

Proof. Each leaf can have measure at most w, the leaves partition the matrix, the matrix 
has measure 1, we are done. D 

If the measure is concentrated only to the 1 (or 0) entries of the matrix, we denote it 
by fii (or yUo). The maximum of the measures of 1-rectangle's (or 0-rectangle's) is denoted 
by Wi (or wq). With the same prove as before we get the following lower bounds: 
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Claim 2.13. C^{f) > I/wq and Cf{f) > l/wi 

Now let us examine how we can apply these bounds on IP. 

Claim 2.14. If R '^ Mjp is a 0-rectangle, then \R\ (the size of R) is at most 2^. 

Proof. We will use linear algebra. We imagine the inputs as vectors over GF{2). We denote 
the set of rows (or columns) of i? by A (or B). Because i? is a 0-rectangle, we have 
AJ-B. This implies that dim (A) + dim (B) (the sum of the dimensions of their generated 
subspaces) is at most n. Therefore |i?| = |y4| ■ |i?| < 2". D 



n-l 



Claim 2.15. If R '^ Mjp is a 1-rectangle, then \R\ (the size of R) is at most 2 

Proof. Like in the previous proof, we are using linear algebra again. We put an extra 1 
digit to the end of each vector to make them pairwise perpendicular. We denote these new 
sets of vectors by A' and B'. Fix an arbitrary element a G A'. Note that (a + A') (1 A' = 
because a + A' always ends with a 0. Therefore | {A') \ > 2\A\ and similarly | {B') \ > 2\B\. 
dim {A') + dim {B') <n + l implies 2"+^ > | {A') \ ■ \ {B') \ > 2\A'\ ■ 2\B'\ = A\A\ ■ \B\ = 
A\R\. D 

From any of the above claims we can prove D{IP) > n + 1 with the help of the dis- 
crepancy lower bound. The number of Os in Mjp is 2^"^^ + 2"~^ while the number of Is is 
22n-i _2n-i^ Let us take the uniform measure /iq concentrated only to the entries of Mip. 
The largest 0-rectangle's measure is at most 2"/(2^"~^ -t- 2"'"^) < 2~"+^. Therefore from 
Claim [213] we have C^{f) > 1/2-"+^ = 2"-^. From Corollary [HU] we have D{IP) > n. 
In a similar argument we could have used fii and 1-rectangles as well, that gives Ci{f) > 
2^ — \ and the same lower bound. So we have proved again D{IP) = n + 1. 
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3 Random EQ 

3.1 Basic Definitions 

We distinguish between two types of random protocols; whether the random coin flips 
are private or public. This means whether each of them can generate a random string for 
oneself only or there is a random string that both of them can see. In this paper we are 
only dealing with bounded error protocols, for unbounded errors see [H]. This part follows 
the book ([8J pp. 28-34) but Claim 1X51 what is stated in a false form in the book (pp. 30) 
was corrected. The notations are: 

Definition 3.1. 

• ForO < e < 1/2, Re{f) is the minimum height of a (private) randomized protocol that 
computes f with error < e. We also denote i?i/3(/) by R{f)- This is called (two-sided 
error) probabilistic protocol. 

• For < e < 1, R\{f) is the minimum height of a (private) randomized protocol that 
computes f with error < e if f{x,y) = 1 and makes no error at all if f{x,y) = 0. We 
also denote Rl,^{f) by R^{f). This is called one-sided error probabilistic protocol. 

. i?°(/) := Rlif) 

• In case of random public coins, we denote the similar values respectively by R^^^{f), 
R\'P''\f) andRy''\f). 

If we have a one-sided error protocol, we can get one with smaller error probability but 
with two-sided error: 

Claim 3.2. /2./(i+.)(/) < RlU), KTii^.)U) < Rl^'-'V) 

Proof. The protocols differ only at the very end; when the last bit (the result) is sent, 
sometimes instead of a (what might be wrong), they might send a 1. If the output of the 
one-sided protocol was 0, they will output with probability a in the two-sided case. (So 
this implies, that the mistake will be 1 — a if f{x,y) = 0.) If f{x,y) = 1, then they will 
answer 1 with probability at least 1 — e-|-e(l — a) = 1 — ea, so the chance of making a 
mistake is < ea. If we set ea = 1 — a, then we get the optimal result because the chances 
of making a mistake are equal for both values of /. So a = 1/(1 + e), this gives e/(l + e) 
chance of error. D 

Corollary 3.3. R{f) < R^{f), i?P"^(/) < R^'P'^^f). 

Of course, Re{f) and Rl{f) depend on e only up to a constant factor. It is easier to 
prove for the one-sided error case, we start with this. 

Claim 3.4. For0<6 <e<l, Rjif) < Rl{f) log 6/ log e. 
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Proof. If we have a protocol for e, all we have to do is to repeat it a few times to get one 
for 6; if the answer is 1 in any of the cases, we know for sure that f{x,y) = 1 indeed. If 
all the answers are 0, then we answer 0. It is clear that this is a one-sided error protocol. 
What is the chance that f{x,y) = 1 but all our answers are after t repetitions? It is at 
most e*. So it is enough to repeat our protocol log^ 6 times. D 

Claim 3.5. For < 6 < e < 1/2, Rs{f) < 0(^^^^^). 

Proof. If we have a protocol for e, all we have to do is to repeat it a few times to get one 
for 6; after repeating it t times, we answer what the majority of the t answers were. (For 
simplicity, let us suppose that t is odd.) What is the chance that our algorithm is mistaking 
at least t/2 times if we are repeating it t times? Let us denote the indicator of the event that 
the ith answer is wrong by Zj. By Chernoff' s inequality, Pr[{l/t X]i=i -^j""^) ^ x] < e~^ */^. 
In our case the answer will be the majority of the answers, thus we want to choose x as 
large as possible, to get the best bound achievable this way, but satisfying that x + e < 1/2 
(because of this the chance of a wrong answer will be smaller than the right-hand side). 
This means of course x = 1/2 — e. We have to choose t such that the right-hand side equals 
S. This yields t = -2 log, 5/(1/2 - e)^. D 

Corollary 3.6. For < e < 1/2, R,{f) < 0{R{f) log 1/5). 

A similar statement holds for the public coin model as well. More surprisingly, it is 
even possible to switch from the private coin model into the public coin model for the cost 
of log n bits of communication: 

Claim 3.7. For < 6,e,6 + e < 1/2, Re+sif) < i?r^(/) + logn + 2 log 1/5 + 0(1). 

Proof. The idea is that A sends the results of her private coinfiips to B so they can simulate 
public coins. We only have to show that for each public coin protocol we can construct 
another one using only a few flips without significantly increasing the chance of mistake. 
For the input-pair {x, y) and (public, unknown length) random string r let Z{x, y, r) denote 
the indicator of the event that our answer is wrong. For any fixed x and y, what is the chance 
that picking t random strings (out of all possible random strings) at least (e + 5)t of our 
answers are wrong? By Chernoff's inequality: Pr[{l/t^^^^ Z{x,y,ri) — e) > 5] < e~^ */^. 
If we choose t = 0{n/S'^), then the right hand side is smaller than 2~^". This means that 
there are t such random strings that for all x and y this probability is smaller than 1, so if 
we choose instead of all random strings only from this t, the chance of making a mistake 
will be smaller than e + 6. Sending which random string was chosen requires transferring 
logt = logn + 2 log 1/5 + 0(1) bits, this completes our proof. D 

Note that we cannot get rid of this logn because R{EQ) = 0(logn) while Rp^'^(EQ) = 
1 + 1. We will show the constructions in the next section, here we include another useful 
lemma that bounds the gap between random and deterministic communication complexity: 

Lemma 3.8. R{f) = n{\og D{f)). 

Proof It is sufficient to prove the following: D{f) < 2^^(^)(log 1/(1/2 - e) + Re{f)). So we 
have to construct a deterministic protocol from a random one. The basic idea is that for 
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all the possible 2^"'^^^ communication-strings A calculates the chance of the string from 
her side] this means she goes through all the bits of the string and each time when the 
communication before that given bit implies that it is her turn, she calculates the chance 
that she would send that given bit and multiplies these probabilities. Then she sends this 
real number to B. After this B can calculate, with his probabilities attached to each string, 
the chance of each possible communication-string. Summing these for the cases that give 
a 1 answer, he gets either at most e or a least 1 — e, so he will know for sure whether the 
answer is or 1. The only problem is that she cannot send the exact values because they 
are real numbers, so she has to round them. We have to determine how accurately she 
should send these real numbers to have a small rounding mistake. 

It is enough to send log 1/(1/2 — e) + Re{f) bits of accuracy each time, doing so the 
rounding mistake at each bit-string is at most 2^^^°^^^^^^^''-^^^^^^^^ = (1/2 — t)/2^^^f\ thus 
the total rounding mistake for all the 2^^^"^ cases together is less than 1/2 — e. If we add 
this to the original chance of error what was e, it is still less than 1/2, so B can decide 
safely whether the answer is or 1. D 

3.2 Lower Bounds 

In this section we mainly deal with the EQ function but sometimes our lower bounds 
hold for other functions as well. This section is completely our result. The easiest way to 
obtain a lower bound for EQ is to simply apply Lemma [3.81 using D{EQ) = n + 1: 

Corollary 3.9. R{EQ) = fi(logn). 

If we have a closer look at that proof, we can see that in fact there we derived the 
following lower bound: 

Claim 3.10. RiEQ) > \og{n + 1) - loglog(6(n + 1)). 

Proof. In Lemma [3.81 we have proved: D(f) < 2-^'*^'^) (log 1/(1/2 — e) + R^{f)). This gives 
in our case: n + 1 < 2^^^^\\ogQ + RiEQ)) thus by Corollary OS we have R{EQ) > 
X{6{n + 1)) — log 6, by Claim [T2T] we are done. D 

There is a completely different way to prove a similar lower bound. First we prove it 
only for the one-way case (this means that only A is speaking until the end, then B tells 
the result) because it is easier to understand, then for the general case. 

Claim 3.11. R'^™\EQ) > logn. 

Proof. Let us suppose that the chance of making a mistake is at most e and that A says 
less than logn bits. This means that there are only m < 2'°^" = n possible bit-strings. 
For each of them B has some probability to say or 1. We denote these probabilities by 
qi{y), . . . , Qmiy) if his input is y. For fix y each i either qi{y) > 1/2 or qi{y) < 1/2. This is 
2*" < 2" possibilities. So there are x, y inputs such that for all i both of them is bigger or 
at most 1/2, therefore \qi{x) — qi{y)\ < 1/2. 



Let us suppose that A's input is x. If B's input is also x, he has to say 1 with probabihty 
at least 1 — e but if his input is ?/, he can say 1 with probability at most e. Let us denote 
the chance that A sends the ith. possible bit-string if her input is x by pj. Then we have: 

1 - e - e < ^p^qi{x) - ^PiQiiy) = ^Pi{qi{x) - qi{y)) < ^pil/2 = 1/2 

So e > 1/4 if the communication ends after less then logn bits. This proof works even if 
e = 1/4 by examining a few cases however we omit this part of the proof because it is not 
interesting and our next claim will be stronger. D 

Claim 3.12. R{EQ) > logn - loglogra. 

Proof. We will use the well known fact that (1 — 1/n)"' > 1/4. 

Let us suppose that we have a given random, private coin protocol that finishes in d 
steps and errs with probability at most e. The basic idea is the following: If for a fixed pair 
of inputs X, y, A sends or 1 with the same probability in each step, then the chance that 
the communication will differ is small. 

Let us suppose that we are in the zth step of communication. So far i — 1 bits have 
been sent, this is 2*"^ possibilities. If it is B's turn, A is not doing anything; if it is A's 
turn, then she has a function that tells the chance for each input of sending a or a 1. 
We only consider whether this chance is between and 1/t or between 1/t and 2/t etc. 
where t is a parameter to be fixed later. This is t^ ' possibilities. Through the whole 
communication, even if she is speaking all the time (like usually women do), this gives 
only ni=i t'^' possibilities. If this is smaller then 2", then we have two inputs, x, y, that 
are the same with probability at least (1 - 1/t)"' = ((1 - 1/t)*)'^/* > (1/4)^/* =: 5. If B's 
input is X, then the result has to be 1 with probability at least 1 — e if A's input is also 
X but it can be 1 with probability at most e if A's input is y. Therefore 1 — 6 > Pr[the 
communication is different for (x,x) and {y,x)\ > (1 — e) — e, thus 2e > 6. If we want to 
prove for e, we should choose t such that 6 = (1/4)'^/* = 2e. This implies t = ^^ '^ ^^ = j^-^- 

Therefore from 2" < [iti ^^''"'^ = t^^'-^' < ^2^ = {^f\ we get a lower bound for 
d if the error is at most e. In the default e = 1/3 case, we get 2'^ < (jjt^Itj)^ < (2'^)^ , 
so n < 2'^\og{2d) < A{d), thus we get d > \{n) > logn — log log n. So we have proved: 
R{EQ) > logn — log logn, we are done. 

Note that in fact for any fixed e, we have that n < A{d) if n is big enough, thus 
R^{EQ) > logn — log logn if n is big enough. D 

Note that in the proof we did not use anything about EQ except that all rows are 
different. So our lower bound holds for all functions, except those that have the same row 
twice, but we can suppose that this does not happen, otherwise we could simply forget one 
of the rows. 

Theorem 3.13. R{f) > logn — log logn if all the rows of Mf are distinct. 



This gives a slightly better bound than Lemma [3.81 and much better if D(f)< n. 
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3.3 Upper Bounds 

The most interesting is that although we know several random protocols that run in 
O(logn) time and compute EQ but none of them is truly constructive; they all either 
contain a big, fixed random set or need to find a big random prime. This might be because 
in the proof of Claim [3771 we used Chernoff' s inequality and that is why we cannot transform 
the public coin protocol that needs only 2 steps into an explicit one. In this section we give 
a new algorithm and compare it with the ones previously known. 

Claim 3.14. BP''\EQ) = 1 + 1. 

Proof. Obviously, it cannot be 1, because then the first player should output the answer 
without the second player speaking anything. Now we present two constructions that give 
R^'^'^^{EQ) = 1 + 1. We can turn these one-sided error protocols into two-sided ones using 
Corollary 13. 3 [ so we will be done. 

Partitioning Construction: We need 2" public random bits. We think about them as 
the characteristic vector of a subset of {0, 1}". A sends to B whether her input is in the 
set or not. Now B computes. If only one of their inputs is in the set, then the answer must 
be 0. If both of their inputs are in or out of the set, he answers 1. If their inputs were the 
same indeed, then he answered 1, so they made no mistake. If their inputs were different, 
then with probability 1/2, only one of the inputs was an element of the set, so their answer 
is with probability 1/2, just what we wanted. 

Inner Product Construction: Here we need only n public random bits denoted by z, we 
think about them (and also about x and y) as a vector over GF{2). A sends to B {x, z). B 
compares it with {y, z). If they differ, he answers 0, because x and y must be different. If 
they are the same, he answers 1. We only have to prove that if they differ, then the chance 
that {x, z) and (y, z) also differ is 1/2. Fix a bit where x and y differ. With probability 
1/2, this bit of z equals 1, with probability 1/2, this bit of z equals 0. Changing only on 
this bit and leaving the other bits unchanged, exactly one of their scalar products change. 
This completes the proof. D 

Combining this result with Claim 1X71 and with the lower bound, we get: 

Corollary 3.15. R{EQ) =logn + 0(l) 

Comparing this with the lower bounds, this is almost the best that we can get. How- 
ever, this is not a constructive proof, it only shows that there exists a proper algorithm, 
but we have no clue how to construct it. In the remaining part of this section, we present 
some algorithms that are more constructive but give worse bounds. 

Another way to achieve a similar upper bound, using a random prime, is the following 
result of Rabin, Simon and Yao (see [llj): 

Claim 3.16. R^{EQ) < A\ogn + 2 + 1. 
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Proof. We pick a p prime at random from n^ to 2n^. This is done by A's random string. 
She sends to B p and x mod p. This requires 4 log n + 2 bits. B compares y mod p and 
X mod p. If they are the same, he answers 1, otherwise 0. 

To show that this protocol is correct, we only have to show that there is a good chance 
that a random p will not divide x — y unless x = y. In fact, because of |x — ?/| < 2", it 
can have at most log^iT^ = n/{2logn) prime divisors that are bigger than n^. And we 
know from number theory that the number of primes between n^ and 2?t,^ is approximately 
2?2^/logg(2n^) — 'ri^/loge(n^) > cn^/ (2 log n) where c is a fix constant. Thus the probability 
that p divides x — y is smaller than ^"2 ,^°^"^ = 1/cn, so R^,^{EQ) < 41ogra + 2 + 1. D 

There is another method using a prime number and giving the same result but the 
difference is that this one needs only one fixed prime (depending of course on n). This 
appeared in the book ([8] pp. 30-31). We are going to introduce a parameter to get different 
upper bounds for different error-tolerance. 

Claim 3.17. i?0/„(EQ) < 21ogm + 2 + 1. 

Proof. Pick any prime p from m to 2m. (The existence of such a prime follows from Cheby- 
shev's theorem.) We represent the inputs as polynomials over GF{p): A{z) = Xn-iZ^~^ + 
. . . XiZ + Xq and we similarly obtain B{z). Now A picks a random number Zq over GF{p). 
She sends zq and A{zq) to B. B compares A{zq) and B{zo). If they are different, then the 
answer must be 0, otherwise he answers 1. If A{zo) = B{zo), but A{z) ^ B{z) then zq is the 
root of A{z) — B{z) what is a polynomial with degree <n—l. The chance that we picked 
a root at random is less than n/p < n/m. So we obtained Rn/m — 21og?7i + 2 + 1. D 

Choosing m = 2n we have: 

Corollary 3.18. R^{EQ) <21ogn + 4 + l. 

This is worse with a factor of 2 than Corollary I3.15[ but it is a bit more constructive; 
if we have to work with a fixed n (usually this is the case in applications), we can include 
any p in the algorithm, A does not have to search a random prime again and again like in 
the previous algorithm. Another advantage compared to the previous algorithm is that we 
can quickly repeat the test by sending a new random element from GF{p). For a fix error 
tolerance e choosing m = \n/ -(/e] and repeating it k times this gives: 



Theorem 3.19. R^^{EQ) < {k+\) logn+^i^ log l/e+0(l) using a constructive algorith 



m. 



If e > 1/n, this bound is the strongest in the case when /c = 1, so for a "big" e this 
algorithm should not be repeated at all, it is better if we pick a small m. In this case we 
obtain: 

Corollary 3.20. R'^{EQ) < 21ogn + 21og 1/e + 0(1) using a constructive algorithm. 

If e = 1/n^ then we have to choose k ^ v«, this yields approximately (d + 2a/5 + 
1) logn = log 1/e + 2-y/log 1/e + logn bits of communication. 
If e is even smaller then we can only get a superlogarithmic bound. 
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Now we examine a generalization of the Partitioning Construction that gives a better 
upper bound for a "big" e error-tolerance. Instead of dividing the set of inputs into 2 parts, 
we divide them into k parts and combine it with Claim 1X71 

Pr[x and y are in the same partition] = 1/k. To send which partition an input belongs to, 
requires logfc + 1 bits. Thus R^f^ (EQ) < \ogk. If we use Claim [3?7t we get R^_^-^,j^{EQ) < 
log k + log n + 2 log 1/6 + 0(1). How to choose k and 5 for a given e error-tolerance? We 
have 6 + 1/k = e, and our goal is to minimize log k + 2 log 1/6. We can simply do this 
by transforming and applying the arithmetic-qubic mean inequality: logfc + 2 log 1/5 = 
-log(^) = 1 - log(J,) > 1 - log((^^^)3) = 1 - 31og(|) and the equality holds iff 
6 = 2/k. So e = 3//c, and for the overall complexity is logn + 3 log k + 0(1). If we have an 
e, we can pick k = |"3/e] . This gives the following upper bound: 

Theorem 3.21. R°,{EQ) < logn + 31ogl/e + 0(1) 

This is slightly better than the one we got by applying the prime-method if e > 1/n 
and does not need any prime testing or number theory at all. It is less constructive in the 
sense that we have no idea at all how to find suitable random strings that we got from 
Chernoff 's inequality. But if we are working with a fix n, then we can include the necessary 
random strings, therefore this is the best of all the above algorithms if e is bigger than 1/n 
and this seems to be realistic in most applications. 
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4 The Direct-sum problem 

4.1 The Problem 

The Direct-sum problem arises not only in CC but in almost all computational models 
and is yet unsolved basically in all of them. It was introduced to CC by Karchmer et. al [6]. 
The problem is simply this question: Can it be easier to solve two independent problems 
at the same time than solving them one after the other? 

In CC, the problem has several versions. To state them in a nice form, we introduce first 
some notations. If we have two functions, / and g, and both A and B have two inputs: x/, 
Xg and yj, yg and they would like to compute both f{xf,yf) and g{xg,yg) then we denote 
this problem hj f x g. In the case where they would like to know only f{xf, yj) Ag{xg, yg) 
we denote it by / A g. The problem is also interesting when / and g are the same; in this 
case they both hold k inputs {{xi, . . . ,Xk) and {yi, . . . ,yk)) and would like to compute 
f{xi,yi) for every i. We denote this version by x^/. In the case where we want to know 
only whether all the outputs are 1, we denote it by Akf- 

Now we can state the following conjectures: 

Conjecture 4.1. 

(z) D{fAg) = D{f) + D{g) 
(ii) D{f X g) = D{f) + D{g) 
(ttt) D{Akf) = k ■ D{f) 
(iv) D{Xkf) = k-D{f) 

Obviously the right-hand side is always bigger than the left. 
It is clear that (i) is weaker than (ii) and (iii) is weaker than (iv), moreover (i) is weaker 
than (iii) and (ii) is weaker than (iv). So (iv) is the strongest and (i) is the weakest. 
Although we can not prove for general functions any of them, we know that they hold for 
many specific functions, like EQ, GT or DISJ. Another useful definition of the topic is 
AMT{f) = lim/;_!.oo -D(x/j/)/fc, it is called Amortized Time Com,plexity. 

4.2 Constant Factor Difference in the Direct-sum problem 

Here we give a few counterexamples for the current form of Conjecture 14.11 and then 
modify it to a form for which we cannot present any counterexamples. 

If / is EQ for N = 5 (they both hold a number from 1 to 5), then D{f x f) = 5 + 2 ^ 
2 ■ (3 -|- 1) = 2 ■ D{f). This counterexample works because the information is sent in bits, 
we can not get rid of Base 2. If we were allowed to send instead of bits any arbitrary 
amount of information (but of course the amount of information should be specified in the 
protocol), we could correct this mistake. The cost of one step would be the logarithm of 
the information send. We denote this complexity by D{f). 
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Eg., in the previous case, A could send her number first (costing log 5) and then B 
could send her back the result, this implies D{f) < log 5 + 1. In the Direct-sum case, the 
cost would be log(5 ■ 5) + 2 = 2 ■ D{f), so our counterexample does not work for this little 
modification. 

However, this is not the only type of counterexample that we know. Consider the fol- 
lowing example: Let A's input be 2 + 4 bits and B's be 1 + 4 + 16 bits. The first 2 bits of 
A are pointing to a bit in the 4-bit block of B, the other 4 are pointing to a bit from the 
16-bit block of B. If the first bit of B is 0, then the value of the function is the bit from 
the 4-bit block, if it is 1, then the output is the bit from the 16-bit block. We shall denote 
this / for later reference by TAB(2 — 4) (ThereAndBack 2 — 4). It is easy to see that 
D{f) = 5 -|- 1 and we cannot improve significantly even if we do not insist on the bit-wise 
communication but allow an arbitrary amount of information in each step. 

But we have a solution for / x / in 9 + 2 steps: 

- If both of the first bits of the inputs of B are 1, then he sends a 0, then A replies 
with 2 times 4 bits, finally he sends back the result (2 bits), this altogether yields 11. 

- If one of his first bits is 1 while the other is 0, then he sends 10, then a or 1 
depending which input contains the 1. She replies with 4 + 2 bits and he sends back 
the result (2 bits), this altogether yields 11. 

- If both of the first bits of the inputs of B are 1, then he sends 11. She can reply with 
2 + 2 bits, he sends back the result (2 bits), this altogether yields only 8. 

Note that with counterexamples of these kinds one may achieve only a constant factor 
difference in complexity, so if in Conjecture 14. II we would write an Q before the right-hand 
side, they would not disprove the conjecture. To avoid these counterexamples and have 
a conjecture that does not need Qs, we are going to introduce a new model that differs 
from the classical one only in a constant factor and has no counterexamples at all for 
Conjecture 14.11 However, we are only going to deal with the x case (the (ii) and (iv) parts 
of the conjecture). 

4.3 Fluent Communication 

Here we introduce a new model of communication that is slightly different from the 
classical one. We are going to refer to the classical one as Bit-wise and to the new one as 
Fluent. The basic idea is that someone might send a piece of information faster than 1, 
but for the following sacrifice; if the information is not the one she wanted to send fast, 
she has to send it slower then 1. First, we are going to give a simple example how it works, 
then it will be followed by the exact definition. 

Eg., in the first step of a protocol A would like to send a single bit to B. If the bit is 
0, then the function is computed and the communication is over but if it is 1, they have 
to go on. Now in the case the bit is 0, she has time, but if it is 1, she should hurry to 
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reduce the length of the worst case. We allow her to pick two numbers, a and 6, such that 
1/a + 1/6 = 1 and she can send the 1 in log 6 while the should be sent in log a time. If h 
is smaller then 2, they can finish the communication earlier. 

Definition 4.2. The Fluent Communication is very similar to the classical one. But in 
each step instead of sending a single bit, the player can do the following: First, she has to 
pick an n >2 natural number (she wants to send an information of size n) and ai, . . . , a„ 
reals such that l/ai + . . .+l/a„ = 1. She can send the first type of information in logai time, 
the second in log 02 time etc. (She cannot pick the numbers in the middle of the algorithm 
but she has to pick them before they start to compute the function, so the numbers are built 
into the protocol.) 

Note that if she could pick the numbers during the algorithm, then she could send 
information by what numbers she is choosing, we do not want to allow this. If n = 2 and 
Oi = a2 = 1/2 in each step, then we get the Bit-wise communication. 

In fact, we can assume that n = 2 in each step. We present a way how to make n 
become n — 1 (if n > 3) without increasing the time: If in a given step the numbers are 
ai , . . . , a„ , then instead of them we can choose the numbers ai and -n — , '*',-, , — and in the 

J-' ' "■' ^ l/a2+...+l/a„ 

next step n — 1 numbers: 02(1/02 + . . . + l/on)? ■ ■ ■ 1 an(l/o2 + . . . + l/on)- The sum of the 
reciprocals is indeed 1 and log ^, +^ +i/a — '" log(aj(l/a2 + . . . + l/on)) = logCj, hence we 
are done. 

Corollary 4.3. The Fluent complexity is always at most the Bit-wise complexity. 

Another thing to assume is that A and B are switching among each other until the 
protocol is finished. This is exactly the reverse version of the previous method, we can 
increase n to n + 1 if it is A's turn again after one of the n possibilities, the proof goes in 
exactly the same way. Of course we can assume only either this or the previous version. 

Now we are going to show how the fluent algorithm works for TAB {2 — 4). We have to 
use the "Fluentness" only at the beginning. If the first bit of B is 0, he sends it in log 5 time 
while he sends it in log 5/4 time if it is a 1. (Since 1/5 + 4/5 = 1, this is a correct step.) 
In the 0-case, they need 2 + 1 more bits of communication, in the 1-case 4 + 1 more, both 
yielding a total 4 + log 5/4 time, better than the Bit-wise. Moreover, 2 ■ (4 + log 5/4) < 11, 
the number of bits needed to solve two copies of TAB(2 — 4), so it can even enhance the 
solution of two copies. 

Claim 4.4. The time needed for Fluent Communication equals to log C^{f). 

Proof. If we have a protocol-tree, we can easily construct a Fluent protocol. At each node, 
we count the number of leaves under its two sons. If they are a and b, our two numbers 
shall be {a-\-b)/a and {a + b)/b. This way the time of the total communication by induction 
on the number of leaves is log a + log((a + b)/a) = log b + log((a + b)/b) = log(a + b), just 
what we wanted to prove. 

Showing that a fast Fluent protocol gives a protocol-tree with few leaves is a similar 
argument as the one before. Now the induction goes by the number of steps in the protocol. 
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Let us suppose that the protocol finishes in log r steps in the worst case. Let the numbers of 
the first step be ai, . . . , a„, the number of the leaves in the remaining part of the protocol be 
Li, . . . , Ln, respectively. The induction gives Vi logLj + logOj < logr. After transforming 
l/ttj > Li/r. Summing up and using ^ l/cLi = 1 we get: 1 > X] ^i/r, thus r > ^ Lj = the 
number of leaves of the protocol-tree we constructed, so just what we wanted to prove. D 

Combining with Claim 11.121 we have proved again Corollary 14.31 

By the nature of the Fluent Communication, none of our old tricks for disproving 
Conjecture 14.11 works. We can finally restate the conjecture in the desired form: 

Conjecture 4.5. CP{xkf) = k ■ CP{f) 

Note that CP{Xj.f) < k-CP{f) is trivial. Now we are going to prove that the Amortized 
Time Complexity of / can be at most the Fluent computation of /: 

Theorem 4.6. AMT{f) = \imk^^ D{xkf)/k < logC^if) 

Proof. We are proving by induction on CP{f). We assume that D{xkf') < gi{k) ■ log/ for 
all / < CP{f) where gi{k) < k + ci and q is a constant depending only on / = CP{f'), the 
number of leaves of /'. If we prove that this holds for / = C^(/) as well, we have proved 
the theorem. 

After sending the first bit, we have two easier problems to solve, let us denote them 
by fm and /„. We shall denote Cp(/) by L, Cp{U) by M and Cp(/„) by N. Note that 
obviously M and A^ are both less than L, so we can use the induction. We would like to 
prove the following lemma for a suitable giik)'- 
Lemma 4.7. 2L5'i('=)iog(A-^+^)J > ^ (k\2\9M{i)^ogM^+\gN{k~i)\ogN^ 

Proof. The proof only consists of some calculation. First we replace the lower and upper 
integer-parts by -1 and +1: 29i('=)i°g(^^+^)-i ^ ^ ^fc^ 23^(0 log M+i+gjv(fc-i) log a^+i. Now we get 

rid of the base two: 1/2 (a + 6)9^(^) = /^Y,{^^M3m(^N3^^^-^\ The right side is bounded by 
induction by 4^ (J) M^+^*^A^'^-*+'=^ < 4(M+A^)'^M'="A^'='^. Thus we need: 1/2 (a+6)»^W > 
4(M + NfM^^'N''^ to hold for all M + A^ = L. If we choose gL{k) = mmM+N=L{k + Cn + 
Cm + log 8), our inequality follows. D 

To give a protocol for x^/, we are going to use the ones for x^f^ and Xkfn- Wlog. 
we can suppose that M < A^, so /„ is the tougher problem. In the case, when we are 
going to have i out of the k inputs going toward m and k — i toward n, the first half 
of the message will contain the information which message is going which way, we call 
these messages respectively n- and m-cases. The second half will be exactly of length 
igM{i) logM + (fc — i)gN{k — i) logN giving enough space to communicate the problems 
one by one by induction, we have no problem how to do this part. In the first part, we 
send giik) log(M + A^) — kg^ik) \ogN Os if all k problems are n-cases. So the first half of 
the message will be something like this: 0000. If we have only 1 m-case and k — 1 ra-cases, 
then we use the first giik) log(M + A^) — (7a/(1) — {k — l)g^{k) logN bits to specify where 
the r/i-case is. So if it is the first one, our message should be something like this: 000100 
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(we have to skip 000000 because all messages starting with 0000 are reserved for the full 

n-case). If it is the second one, something like this: 000101 etc. It is guaranteed that we do 

not run out of space by the previous lemma; if we run out, then the sum on the right-hand 

side until that i where we run out of space would exceed the left-hand side yielding a 

contradiction. 

We leave the exact details of the proof to the reader. D 

4.4 Communication with Partial Information 

It is possibly to define everything in communication complexity if we allow the players 
to have a partial information at the beginning of the game depending on their input. In 
this case the input-pair {x,y) e S* C X x y . Surprisingly, we know that the Direct-sum 
conjecture does not hold in this case. This model was studied by Orlitsky but we follow 
the book ([8j pp. 63-66) that follows the paper by Feder et al. jl]. 

We can imagine a partial communication game as a matrix filled with Os, Is and *s. 
The *s denote that (x, y) ^ S, the player cannot have this input-pair. Let us denote such a 
problem by f^. If we fill in all the starred places by Os and Is, we get a particular classical 
problem. We denote the set of these problems by J-". 

Claim 4.8. D{f^) = minf^^D{f). 

Proof. If we have a protocol for any / G J-", then it is a protocol for f^, as well. 
On the other hand, if we have an optimal protocol for /^,, then it gives a rectangle-partition 
of the whole matrix X xY where all rectangles are monochromatic, meaning they contain 
either only Os and *s or Is and *s. If we fill in the * entries by Os and Is regarding which 
rectangle they belong to, we get an / G J-" for which the same protocol works. D 

Now for a while, instead of examining the common 0-1 range case, we will allow func- 
tions whose range is different, namely f{x,y) = x. While D{f) = log|X| and is of no 
interest in the standard model, we can examine it in the case of partial information where 
the complexity will depend on S. We denote the complexity of such a problem by D{S). 

Let us consider the following problem: X = {0, 1}", Y = (2)) thus A has one num- 
ber, while B has two. We are going to denote B's input y by [u,v]. S = {x, [u,v] : x = 
u or X = v}, thus he knows two numbers and she knows one of them and he has to find out 
which. The name of this problem is NBA because you can think about the inputs as the 
names of teams. B is really interested in basketball and knows which two teams played last 
night and would like to know who won. A heard the winner in the news but she does not 
know which two teams played yesterday. How much information is needed to be exchanged? 

A great advantage of this problem is that we can also think about it as a classical 0-1 
range problem; knowing whether m or f equals x is equivalent for B to know whether the 
first or second of his teams is the winner. Now let us examine the complexity. 
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If only she is allowed to speak, she has to send n bits, otherwise there would be two 
unseparated elements of X. We denote this by ]j°'^^-'^°-y (^s) = n. But if both of them can 
speak, they can be faster. 

Claim 4.9. D{NBA) < logn + 1. 

Proof. First B sends A an index i for which Ui 7^ fj. This requires logn bits. Now she can 
send back Xj, this reveals for him which is the winning team. D 

Can this problem be solved faster? The answer is no and it follows from the following 
Claim: 

Claim 4.10. D{S) > logD°"^-"'"^(5). 

Proof. Suppose that their is protocol with D{S) steps. We construct from it a one-way 
protocol with 2^*^'^) steps. For each possible 2^(5*) bit-strings, A sends whether they are 
possible or not. (Whether her input intersects the leaf belonging to that bit-string or not.) 
From this and y, B can determine the only possible bit-string. This reveals the answer 
because the original protocol was good. D 

Let us denote the case when first B is allowed to speak, but after he finishes, only A 
can speak. We denote this by D'^{S) and call it a two — round communication problem. As 
we have seen in the proof of Claim Hl9| D'^{NBA) = logn + 1. Now we are going to prove 
an interesting theorem, that is D'^{S) = 0{D{S)). For the proof, we need some preparation. 

For a problem 5*, we define a hypergraph Gs = {X, E) as follows: For every y & Y 
there is a hyperedge Cy = {x : {x,y) G S}. A coloring of Gs with c colors is a function 
i/j : X ^ {1, . . . ,c} such that for every hyperedge its vertices have all different colors. The 
minimal number of colors needed is the chromatic number of Gs, we denote it by x{Gs)- 
The size of the largest hyperedge is the degree of Gs, we denote it by d{Gs)- It is not hard 
to see that 

\logx{Gs)] = D"— »^(5) > D\S) > DiS) > logidiGs)) 

Now we need a technical claim, that states a suitable family of /las/i-f unctions exists. 

Lemma 4.11. Let m andt he two arbitrary integers. There are constants G and 5 such that 
there exists a family Hm,t that contains k = 6t log m functions whose domain is {1, ... , m} 
and whose range is {1, . . . ,p = Gt"^} such that for every A C {1, . . . , m} of size at most t, 
at least half of the functions from Hm,t o-f^ injective over A. 

Proof. This is a technical proof using a probabilistic argument. We choose k functions, 
hi, ... , hk, at random whose domain is {1, . . . , m} and whose range is {1, . . . ,p} where G 
is going to be fixed soon. For a fix A C {1, ... , m\ of size at most t, the probability that 
a random function is injective is at least 1 ■ ^^ . . . Putd. > (1 — -)* = (1 — ^)* > 3/4 if 
G is big enough (C = 0(1), so it does not depend on t). Let Zi denote the indicator of 
the event that hi is injective over A. We know that Pr[Zi = 1] > 3/4. Using Chernoff's 
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inequality, we obtain Pr[{l/k^^^^ Zi) — 3/4 > — x] < e~^ '^^'^. If a; = 1/4, the right-hand 
side is e"®*^^-*, this is the chance that half of the functions is injective over A. There are 
< m* possible t element subsets of {1, . . . ,'m}. So if m* ■ e^®'^'^^ < 1, then there exists a 
suitable family of functions and we can achieve this by choosing 5 small enough, depending 
on m and t. D 

Theorem 4.12. D'^{S) = 0{D{S)). 

Proof. We construct a two-round protocol using Gs- We fix a coloring ip of Gs with x{Gs) 
colors and fix a family of functions H = H^(^Gs)AGs) satisfying the conditions of the pre- 
vious Lemma. The colors used to color the vertices of the edge Cy determine a subset 
A C {1, . . . ,x{Gs)} that has size at most d{Gs), thus it satisfies the condition of the 
lemma, therefore there is a function h E H that is injective over A. 

The protocol is simple. A sends the name of /i, then B sends back h{il){x)) and they are 
done, he knows x because of the injectivity. 

Sending h requires log \H\ = \og{6d{Gs) logx{Gs)) = log 5 + \ogd{Gs) + log\ogx{Gs) 
bits. Sending h{jp{vx)) required \og(C{d{Gs)Y) = logC + 21og(i(G5) bits. This is to- 
gether 0(l) + 0{\ogd{Gs))+\og\ogx{Gs)- But we know that \ogd\Gs) < D{S) and also 
\\ogxiGs)] = D°^^-^''y{S) and logD°"""'""?'(5) < D{S). Therefore this is in fact 0{D{S)) 
bits of communication. D 

Note that we can see from the proof that we have not used that half of the functions 
from H are injective. This suggests that if they have to solve several problems at the same 
time, they might be faster because B might find a function that is injective for many of 
his inputs at the same time. This is in fact true. 

Theorem 4.13. D'^^x^S) = 0{k\ogd{Gs)) + log A; ■ \og\ogx{Gs))- 

Proof. The protocol is almost the same as in the previous theorem. When choosing a hash- 
function, B can choose an hi E H such that it is injective for at least half of his inputs. 
Then he chooses &n h2 E H injective for at least half of his remaining inputs etc. He sends 
the names of this log k functions, then he sends which inputs belong to which functions. 
Finally A sends back every hj(i){il){xi)) where j{i) is the index of the proper hash function 
for each Xi input. 

Sending the names of the functions requires 0(log A;(log(i(G5) -|- loglogx(G'5)) bits. 
Sending which input belongs to which function requires k log log k bits, but we can use a 
better prefix coding; if an input Xi belongs to the jth function, he sends j — 1 Is followed 
by a 0. Because at most 1/2-' inputs belong to hj, this is at most X]i°^ jk/l^ = 0{k) bits. 
Sending back the hash-values takes 0{klogd{Gs)) bits. Altogether this is what we wanted 
to prove. n 

Now let us consider the case when S = NBA. We know that this is equivalent to 
a 0-1 range problem with partial information. It is easy to see that xiGNBAJ = 2" and 
diGjvBA) = 2. Hence D{XkNBA) = 0{k + \ogk-\ogn) « klogn = kD{NBA). Therefore 
the Direct-sum conjecture is not true in the partial information case. 
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Note that this does not give us a counterexample in the standard case using Claim I4.8[ 
because we can fill in the *s with Os and Is in the direct-sum version that it does not be- 
come the direct sum of any standard function, so the classical conjecture still remains open. 



4.5 The Direct-sum problem in the Randomized case 

We can ask the same question in randomized case. The best way to ask is the following: 
Is it true that i?^"''(xfc/) = k ■ BF'^^[f)l (Again, we are not interested in constant factor 
difference.) Here, not even KP'^^{Xkf) < k-Rf^^{f) is trivial because the chance of mistake 
commulates. We can only state BF'\xkf) < k-R^I'^^^f) < 0{k\ogk- RP^\f)) using Claim 
13.51 In this section we are going to prove that -R^"^(Xfc/) = 0{k ■ Rf^^{f)) for / = EQ. 
This section is completely our result. 



Theorem 4.14. RP'^^XkEQ) = 0{k). 

Proof. First note that it is sufficient to prove RP^'^{XkEQ) = 0{k) for any e because of 
Claim 1X51 Another useful thing, that it is enough if we present an algorithm with expected 
running time 0{k). Then using the Markov inequality we can get rid of cases when it is 
running for more then 0{ck) increasing the error by at most 1/c. 

Claim 13.141 gives R^'P'^^{EQ) = 1 + 1. Moreover, we know that ii x ^ y, then the 
chance that the answer is wrong is exactly 1/2. We denote the bit sent by A by TEST{x) 
and the bit for which B replies 1 (accept) by TEST{y). Note that of course when that 
TEST uses a different random string for each Xi. Also note that when they test again for 
the same Xj, TEST{xi) denotes a different bit. The reader may always easily figure out 
when a new test is applied. (Usually in the same paragraph TEST denotes the same thing.) 

First A sends TEST{xi) for each i. B compares these with each proper TEST{yi) and 
sends back which of them are equal. This takes 0{k) bits of communication. 

For some pairs it turns out that they are not equal, we can forget about them. We 
group the rest into pairs and for them, she sends TEST{xi) © TEST{xj). ('©' denotes 
the sum mod 2.) He compares this with TEST{yi) © TEST{yj) and sends back whether 
they are equal or not for all pairs. This takes 0{k/2) bits of communication. 

Again, if they are not the same, we know for sure that either Xi ^ yi or Xj ^ yj. 
Moreover, the chance that he detects this is exactly 1/2. If TEST{xi) © TEST{xj) ^ 
TEST{yi) (BTEST{yj), then she sends the same TEST{xi) (without adding TEST{xj) 
to it). If this does not equal TEST{yi), then we know for sure that Xj 7^ yi. If they equal, 
then TEST{xj) 7^ TEST{yj), thus Xj 7^ yj. We call this a track-back. In both cases we have 
excluded a wrong input-pair and we perform a TEST for the other pair; if the TEST de- 
tects a difference again, we know that the other was wrong as well, if it satisfies the TEST, 
we can be more certain that they equal. Discovering each wrong input-pair requires 0(1) 
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bits of communication, and after this second step, we have tested each remaining input- 
pair at least twice. 

The algorithm goes on like this; we make fours, eights and so on until all the k in- 
puts are tested together. (We can suppose that k is a power of 2.) This requires 0{k) + 
0{k/2) + 0{k/4) + . . . + 0(1) = 0{k) bits of communication without counting the track- 
backs at detecting a wrong pair. A track-back at the Ith round requires 0{l) bits. The 
chance that a wrong pair is detected in the Ith round is exactly 1/2'. So the expected cost 
of detection is Yli^ V^' = 0(1). The number of wrong input-pairs is at most the num- 
ber of input-pairs, hence < k. Therefore the expected running time is at most 0{k) indeed. 

The chance that a fixed wrong input-pair is not detected is exactly 1/2'°^'^ = l/k. 
But if instead of testing in each round only once, we can test twice. This doubles the 
bits sent but halves the chance of erring. The number of bits sent during a track-back 
reamins the same. The chance that a fixed wrong input-pair is not detected becomes 
1 — ^ °^ 3/4' = l/4^°g'^ = l/fc^. So the chance that there is any undiscovered wrong input- 
pair is at most k/k"^ = l/k < 1/2. Therefore we have proved R^'P'^^{XkEQ) = 0{k). D 

Note that obviously the complexity is at least k because the answer is k bits. So we 
know that W\xkEQ) = Q{k). 

Using a certain random protocol for GT (see [H| pp. 170-171) a similar argument shows 
that W^{XkGT) = 0{k ■ RP'^\GT)). This suggests the following conjecture: 

Conjecture 4.15. BF''\xkf) = 0{k ■ RP'^^f)) 
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5 Complexity Classes 

In the first subsection we follow the book ([8] pp. 58-59), the other subsections are 
completely our results. 

5.1 Classes 

We can categorize communication complexity problems just like computational com- 
plexity problems into classes. In fact, we can define all important classes in CC as well. 
However, because here every problem is solvable in n + 1 time, the classes are rather 
compared to polylog(n) than to polinoms of n. The basic classes are: 

P- = {/:Z}(/) = polylog(n)}, 

iVP- = {/:iVi(/)=polylogH}, 

coNP^" = {/ : N^if) = polylog(n)}, 

5PP- = {/:i?(/)=polylog(n)}, 

i?P- = {/:i?i(/) = polylogH}, 

coi?P- = {/:PO(/)=polylog(n)}. 

Note that Rpp^'>^^^ = RP^^ because of Lemma O 
Of course these classes are related to sequence of functions for all n-s, not for a single 
function. Unlike in computational complexity, here we can prove or disprove almost all 
relations among these classes. We summarize the known results below: 

Theorem 5.1. Hierarchy of Communication Complexity Classes: 

(0) P^^ C RP^'' C NP^^. 

(1) P^^ = NP^" n coNP''^. 

(2) P"^ ^ NP'"' ^ coNP"". 

(3) P"^ ^ PP^^ ^ coRP'"'. 

(4) coNP^^XEPP^^ ^ 0. 

Proof. (0) follows from D{f) > R\f) > N\f). 

(1) is known as the Aho-Ullman-Yannakakis Theorem [Tj. 

For (2) and (3), functions NE and EQ are the counterexamples because D{EQ) = 
D{NE) = N^{EQ) = N°{NE) = n + I, while N^{NE) = N°{EQ) = logn, R\NE) = 
W{EQ) = O(logn), R\EQ) = R^{NE) = fi(n). 

For (4), the DISJ function is the counterexample. N^{DISJ) < \ogn is trivial while 
R{DISJ) = il(n^''^) was first proved by Babai et al. [2], then it was improved to f2(n) by 
Kalyanasundaram and Schnitger [5], later this proof was simplified by Razborov [15]. D 
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To study further relations among complexity classes, it is useful to define reducibility 
and completeness, as in the book ([8j pp. 58-59): 

Definition 5.2. Let the inputsize of f he n and the inputsize of g he m where m = 
2Poiyiog(n)^ / IS reducihle to g if a pair of functions h^ : {0, 1}" -^ {0, 1}™ and hy : {0, 1}" -^ 
{0,1}™" exist such that f{x,y) = 1 <^ g{hx{x),hy{y)) = 1. For a class C, a series of 
functions g (the same function with different inputsizes) is C -complete, if g E C and if 
every f E C is reducihle to g (with the proper inputsize). 

It can be easily proved that DISJ is co A^P'^'^-complete. We omit the proof here, because 
in Section [5751 we will give an equivalent definition of reducibility and prove this statement 
there. 

We can define the analog classes of the polynomial hierarchy: 

Sq'^ = {/ : The communication matrix of / is 1 except a single rectangle that is filled 
with Is }. 

IIq'^ = coSq"^ = {/ • The communication matrix of / is except a single rectangle 
that is filled with Os }. 

nr+i = COS- , = {/:/ = ^t^'^' h. h e s^} 

It can be easily proved that H'^ = NP'^'^ and similarly II^'^ = coNP'^^. We know complete 
problems for all of these classes (to be discussed in Section [O]) . although we still do not 

7 

know whether T,^ = Il2'^. 

5.2 Space-bounded Communication 

We can even define a corresponding class to PSPACE. To do this, first we have to define 
the cost of communication measured in Space instead of Time. This class might be useful 
for proving the existence of special Oracles or maybe even used to give a lower bound for 
the Space used by a Turing machine computing a certain function, like we can use EQ to 
give a quadratic lower bound for the Time used to decide the language of Palindromes on 
a one-tape Turing machine (see [3]). 

We redefine the communication in the following way: A and B are still supernatural 
beings capable of computing anything but now they only have a limited amount of memory 
and that is common. The minimum size of this common memory that they can use to 
evaluate the given function / shall be denoted by 5'(/). At the beginning it is filled with 
Os. Then in each step one of the players can put there an arbitrary message depending 
only on the previous message and his input. They are finished when both of them knows 
the value of f{x,y). We can also imagine this as two people communicating who have no 
memory at all (however, they can remember their own input) and are allowed to send each 
other a rewritable disk. The question is how big the disk has to be if both of them wants 
to know the value of f{x,y). Let's see an example. 
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Claim 5.3. S{EQ) = log(n) + 0(1). 

Proof. We present a construction. A sends her bits one after the other along with their 
ordinal number and a leading 1, meaning that it is up to B to speak. B replies to each 
message with his bit with the same ordinal number and a leading 0. This requires 2 + logn 
space. If in a step his bit differs from her, they know that the answer is 0, the algorithm 
is over. If they finish sending all their bits, the answer is 1. Therefore, S{EQ) < logn + 
0(1). D 

To get lower bounds for S{f), we need the following lemma: 

Lemma 5.4. S{f) > A(D(/)) > \ogD{f) - \og\ogD{f). 

Proof. It is sufficient to show that if we have a protocol using S{f) space, then we can 
construct an another one using S{f)2^^^^ time. The new protocol is the following: A sends 
in order all her replies to all possible messages. After this B can easily simulate the old 
protocol by himself. So this takes S{f)2^^^^ time, just what we wanted. D 

Combining methods like the one presented in Claim [5751 with Lemma [5.41 we get: 

Corollary 5.5. 

S{EQ) = e{\ogn). 
S{GT) = Q{\ogn). 
S{IP) = e{\ogn). 
S{DISJ) = &{\ogn). 

Moreover, in fact we have proved S'(/)/logn — ;■ 1 for all of the above four functions. 



Note that in the proof of Lemma 15.41 we used only that A has no memory. In the case 
when B can remember everything, we denote the required space instead of S{f) by S^{f). 
Obviously S{f) > S^{f). The corollary of Lemma [5.41 

Corollary 5.6. S^if) > A(^(/)). 

But we can improve the upper bound for S^: 

Claim 5.7. S^{f)2^^^f'^ <n + 0{l). 

Proof. We have to give a protocol that is using only b space where 62^ = n. We divide up 
the n bits of A into n/b disjoint blocks, each of size b. In each step B sends the name 
of a block and A sends back that block of hers. At the end B knows everything, he can 
compute the function, this requires max(6, logn/6) = b space, just what we wanted. D 

Combining the previous corollary with the previous claim we get: 

Corollary 5.8. 

5^(Eg)2^^(^«)=n + 0(l). 
5^(GT)2^-'(«^) = n + 0(l). 

34 



SA{^IP)2S^{iP) =n + 0{l). 
^^(Z)/5J)2^^(^^^-^)=n + 0(l). 

The exact value of S{EQ) is yet unknown. 

These resuhs suggest to define the following classes: 

SPACE'^%g{n)) = {f:S{f)=g{n)} 

PSPACE^^ = U^^^ SPACE''%log''{n)) = {/ : 5(/) = polylog(n)} 

So far we were unable to find complete problems, however, we believe that there exists 
some. 

5.3 Oracles 

We can even introduce Oracles in communication complexity. Doing so, we have to mix 
the fact that they both know only their own inputs and yet asking the Oracle, it should re- 
spond some useful information. To achieve this, at each question they ask simultaneously; 
whether they raise a question or just communicate, is built into the protocol and their 
questions depend only on the previous communication and their own input. The Oracle is 
a function g{x', y') whose inputsize is m where m = 2"^°^^^°^^^^ When they ask a question, A 
determines x' from her input and the previous communication and similarly B determines 
y\ The Oracle's answer is g{x',y'). Each question counts to be 1 bit of communication. 
(Or a space on the tape in the Space-bounded case.) 

Eg., let the Oracle accept inputs of length n and be able to say GT{x',y') (whether 
its first input is bigger then the second one with the usual ordering). Now A and B can 
compute EQ in 2 steps. First they input x and y, then x and y. If both answers are 1, 
then X = y, otherwise not. We denote this by D'^^{EQ) = 2. 

Another example: The Oracle accepts inputs of length 2n and is able to say DISJ{x', y'). 
Now they can solve the problem with a single question. A inputs xx and B inputs yy. 
This implies D'^^^-^EQ) = 1. 

Now we give an equivalent definition to reducibility (Def I5.2p : 

Claim 5.9. Let the inputsize of f ben and the inputsize of g be m where m = 2P°^y^°s("\ 

/ is reducible to g iff D^{f) = 1. 

(Unless g is constant, further on we are going to assume this.) 

Proof. If / is reducible to g, then they ask hx{x) and hy{y). The Oracle's answer will equal 
to fix,y). 

If D^{f) = 1, it means that they communicated only a single bit or asked only one question. 
The former means that D{f) = 1, hence it is reducible to any g that is not constant. The 
latter means that they ask a question that depends only on their inputs, thus they have 
an hx and an hy function, just what we wanted to prove. D 
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Corollary 5.10. Note that if Ds{f) = I, then f{x,y) = g{h^{x),hy{y)). 

Claim 5.11. V/ G Hf D^^^\f) = 1. 

Proof. The entries of / can be covered hj m = 2p°^^'°s(»^) 0-rectangles. A and B both 
input a m bit long message to the Oracle indicating which 0-rectangles intersect their 
inputs. If these are not disjoint, then the value of the function is obviously 0. If these are 
disjoint, then the value cannot be 0, because all Os are contained in one of the m rectangles, 
thus the answer is 1. D 

Corollary 5.12. DISJ is coNP^'^ -complete. 

Now we are going to give E^^-complete and n^'^-complete functions for all k. In fact we 
are going to show that if we have a given class C and a C-complete function, how we can 

construct a complete function for 3C = {f '■ f = Vj=i /«> fi ^ ^}- Let coDISJf : 

{0, 1}™" X {0, 1}™" -^ {0, 1} be the following function: coDISJf{{xi . . . x„), (yi . . . y^)) = 
1 -^ 3i f{xi,yi) = 1. 

Claim 5.13. If g is C-complete, then coDISJg is 3C -complete. 

Proof. The proof is very similar to Claim 15.111 Let / G 3C be an arbitrary function, the 
inputs are x and y. By the definition of 3C, the set of the 1 entries of g is the union of the 
1 entries of m = 2P°iy'°g(") functions: /i, . . . , /„, Wifi G C, thus D3{fi) = 1. This means 
because of Corollarv 15.101 that for suitable Xi,yi input-pairs f{x,y) = g{xi,yi). Now they 
ask from the Oracle: (xi . . . Xm) and {yi . . . ym)- If the answer is 1, then there is an i such 
that g{xi,yi) = 1, therefore f{x,y) = 1 as well. If the answer is 0, then Vi g{xi,yi) = 0, 
this implies f{x,y) = 0, so this solves our problem indeed. D 

This gives complete functions for J^l^ and 11^^ for every k. Eg., coDISJDISJ is Tj"^- 
complete. It is the easiest to imagine the inputs as two n x n matrices and the question 
is whether the two inputs have a disjoint row. However, it still remains an open question 
whether the classes Sg*^ and U'^'^ are the same or not. 
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